NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

An Analysis of Recent Advances in Deepfake Image Detection in an Evolving Threat Landscape

https://doi.org/10.1109/SP54263.2024.00194

Abdullah, Sifat Muhammad; Cheruvu, Aravind; Kanchi, Shravya; Chung, Taejoong; Gao, Peng; Jadliwala, Murtuza; Viswanath, Bimal (May 2024, IEEE Security & Privacy (Oakland) 2024)

Full Text Available
Measurement of Embedding Choices on Cryptographic API Completion Tasks

https://doi.org/10.1145/3625291

Xiao, Ya; Song, Wenjia; Ahmed, Salman; Ge, Xinyang; Viswanath, Bimal; Meng, Na; Yao, Danfeng Daphne (March 2024, ACM Transactions on Software Engineering and Methodology)

In this article, we conduct a measurement study to comprehensively compare the accuracy impacts of multiple embedding options in cryptographic API completion tasks. Embedding is the process of automatically learning vector representations of program elements. Our measurement focuses on design choices of three important aspects,program analysis preprocessing,token-level embedding, andsequence-level embedding. Our findings show that program analysis is necessary even under advanced embedding. The results show 36.20% accuracy improvement, on average, when program analysis preprocessing is applied to transfer bytecode sequences into API dependence paths. With program analysis and the token-level embedding training, the embeddingdep2vecimproves the task accuracy from 55.80% to 92.04%. Moreover, only a slight accuracy advantage (0.55%, on average) is observed by training the expensive sequence-level embedding compared with the token-level embedding. Our experiments also suggest the differences made by the data. In the cross-app learning setup and a data scarcity scenario, sequence-level embedding is more necessary and results in a more obvious accuracy improvement (5.10%).
more » « less
Full Text Available
A First Look at Toxicity Injection Attacks on Open-domain Chatbots

https://doi.org/10.1145/3627106.3627122

Weeks, Connor; Cheruvu, Aravind; Abdullah, Sifat Muhammad; Kanchi, Shravya; Yao, Daphne; Viswanath, Bimal (December 2023, Proceedings of the 39th Annual Computer Security Applications Conference (ACSAC))

Chatbot systems have improved significantly because of the advances made in language modeling. These machine learning systems follow an end-to-end data-driven learning paradigm and are trained on large conversational datasets. Imperfections or harmful biases in the training datasets can cause the models to learn toxic behavior, and thereby expose their users to harmful responses. Prior work has focused on measuring the inherent toxicity of such chatbots, by devising queries that are more likely to produce toxic responses. In this work, we ask the question: How easy or hard is it to inject toxicity into a chatbot after deployment? We study this in a practical scenario known as Dialog-based Learning (DBL), where a chatbot is periodically trained on recent conversations with its users after deployment. A DBL setting can be exploited to poison the training dataset for each training cycle. Our attacks would allow an adversary to manipulate the degree of toxicity in a model and also enable control over what type of queries can trigger a toxic response. Our fully automated attacks only require LLM-based software agents masquerading as (malicious) users to inject high levels of toxicity. We systematically explore the vulnerability of popular chatbot pipelines to this threat. Lastly, we show that several existing toxicity mitigation strategies (designed for chatbots) can be significantly weakened by adaptive attackers.
more » « less
Specializing Neural Networks for Cryptographic Code Completion Applications

https://doi.org/10.1109/TSE.2023.3265362

Xiao, Ya; Song, Wenjia; Qi, Jingyuan; Viswanath, Bimal; McDaniel, Patrick; Yao, Danfeng (January 2023, IEEE Transactions on Software Engineering)

Full Text Available
Poster: Comprehensive Comparisons of Embedding Approaches for Cryptographic API Completion

https://doi.org/10.1109/ICSE-Companion55297.2022.9793808

Xiao, Ya; Ahmed, Salman; Ge, Xinyang; Viswanath, Bimal; Meng, Na; Yao, Danfeng Daphne (May 2022, 2022 IEEE/ACM 44th International Conference on Software Engineering: Companion Proceedings (ICSE-Companion))

Full Text Available
Throwing Darts in the Dark? Detecting Bots with Limited Data Using Neural Data Augmentation

https://doi.org/10.1109/SP40000.2020.00079

Jan, Steve T.K.; Hao, Qingying; Hu, Tianrui; Pu, Jiameng; Oswal, Sonal; Wang, Gang; Viswanath, Bimal (January 2020, The 41st IEEE Symposium on Security and Privacy (IEEE SP))

Full Text Available
Neural Cleanse: Identifying and Mitigating Backdoor Attacks in Neural Networks

https://doi.org/10.1109/SP.2019.00031

Wang, Bolun; Yao, Yuanshun; Shan, Shawn; Li, Huiying; Viswanath, Bimal; Zheng, Haitao; Zhao, Ben Y. (May 2019, IEEE Symposium on Security and Privacy)

Full Text Available
With Great Training Comes Great Vulnerability: Practical Attacks against Transfer Learning

Wang, Bolun; Yao, Yuanshun; Viswanath, Bimal; Zheng, Haitao; Zhao, Ben Y. (August 2018, Proceedings of the 27th USENIX Security Symposium)

Full Text Available
Automated Crowdturfing Attacks and Defenses in Online Review Systems

https://doi.org/10.1145/3133956.3133990

Yao, Yuanshun; Viswanath, Bimal; Cryan, Jenna; Zheng, Haitao; Zhao, Ben Y. (October 2017, Proceedings of the 2017 ACM SIGSAC Conference on Computer and Communications Security)

Full Text Available

Search for: All records